t 



OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/824,178 



DATE: 04/26/2001 
TIME: 22:18:59 



INPUT SET: S36619.raw 



I his Raw Listing contains the General 
Information Section and up to the first 5 pages. 



(ii) TITLE OF INVENTION: HUMAN ENDOSULFINE GENE 
{iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Abbott Laboratories 

(B) STREET: 100 Abbott Park Road 

(C) CITY: Abbott Park 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP: 60064-3500 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: System 7.0.1 

(D) SOFTWARE: MS Word 

^Vj:1-"CURRENT^ APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/824,178 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/779,775 

(B) FILING DATE: 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Priscilla E. Porembski 

(B) REGISTRATION NUMBER: 33,207 

(C) DOCKET NUMBER: 6024. US. 01 



SEQUENCE LISTING 



General Information: 



APPLICANT: 



Jean -Marc Roch 
Victoria E.S. Scott 
Kristi L. Anderson 
James P. Sullivan 




(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 708/937-0378 

(B) TELEFAX: 708/938-2623 



PAGE: 2 RAW SEQUENCE LISTING DATE: 04/26/2001 

PATENT APPLICATION US/09/824,178 TIME: 22:19:00 

INPUT SET: S36619.raw 

47 (C) TELEX: 

48 

49 

50 (2) INFORMATION FOR SEQ ID NO : 1 : 

51 (i) SEQUENCE CHARACTERISTICS: 

52 (A) LENGTH: 1103 base pairs 

53 (B) TYPE: nucleic acid 

54 (C) STRANDEDNESS : double 

55 (D) TOPOLOGY: linear 

56 (ii) MOLECULE TYPE: cDNA 

57 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
58 

59 GTGACAGGAG CCGAAGCAGC AGCGCAGGTT GTCCCCGTTT CCCCTCCCCC TTCCCTTCTC 60 

60 CGGTTGCCTT CCCGGGCCCC TTACACTCCA CAGTCCCGGT CCCGCCATGT CCCAGAAACA 120 

61 AGAAGAAGAG AACCCTGCGG AGGAGACCGG CGAGGAGAAG CAGGACACGC AGGAGAAAGA 180 

62 AGGTATTCTG CCTGAGAGAG CTGAAGAGGC AAAGCTAAAG GCCAAATACC CAAGCCTAGG 240 

63 ACAAAAGCCT GGAGGCTCCG ACTTCCTCAT GAAGAGACTC CAGAAAGGGC AAAAGTACTT 300 
• 64 TGACTCAGGA GACTACAACA TGGCCAAAGC CAAGATGAAG AATAAGCAGC TGCCAAGTGC 360 

65 AGGACCAGAC AAGAACCTGG TGACTGGTGA TCACATCCCC ACCCCACAGG ATCTGCCCCA 420 

66 GAGAAAGTCC TCGCTCGTCA CCAGCAAGCT TGCGGGGTAA CCTGAGCCCC CCTCTCCTCC 480 

67 CCTTCCTCAA CCACTGGACG TTTATATATT ATAGGCAGGG ATGAAATGGG CACCTAGTCA 540 

68 GATCTTCTCA GCTTGCTAGC CAGAAATGAC TGTGATTCTG CTGGGGGCTG CTGAGAAGGT 600 

69 AATGTAGGTT GAAAAGGGGC TCTAAGTTTA TTTATTTCGT TAGATTGACA CTTCCACACA 660 

70 CTCCCTGTAG TCCAGGTAGG GCCTAGAAAT AGGAAAGGCT AGGATTGGAT AATGCTGCAA 720 

71 ATGCTTTTTT TGTGTGAGAA ACTGGAGAGA TGTGATTTCT CCTTTTGGGA GAGAATGTCC 780 

72 CAAAATTGAT TAGGCTGAGC CTTGGGAATA GTTTGGCAGG TTTAACATCC CAAGGCTAAC 840 

73 CTAACGTAGT TGGGAAAGGT AGATTGAATG AGACATGTTT TCTGTGCTTC TAAGTGTTCT 900 

74 GTCCCTTAGG CTGCTATTGC TTCATGTTTC CATTATGGCA GGTTTAGAGA ATCCTTAAAA 960 

75 AGAAAAATTG ACTTGCTTGC CTAAAACTAC AGTGCCCCCT TAGCCTCCAT TACTTAGTAT 1020 

76 CTCTTACAGT TTGCTCTGGC TCTCAAATAA TATAAAGATT GATGAACATT ATTCACAAAA 1080 

77 AAAAAAAAAA AAAGGGCGGC GCG 1103 
78 

79 

80 (.2J— INEO RMAT I ON -FQR^-S EQ I D N0 -:-2-: ~ 



81 (i) SEQUENCE CHARACTERISTICS: 

82 (A) LENGTH: 1199 base pairs 

83 (B) TYPE: nucleic acid 

84 (C) STRANDEDNESS: double 

85 (D) TOPOLOGY: linear 

86 (ii) MOLECULE TYPE: cDNA 

87 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
88 

89 GTGACAGGAG CCGAAGCAGC AGCGCAGGTT GTCCCCGTTT CCCCTCCCCC TTCCCTTCTC 60 

90 CGGTTGCCTT CCCGGGCCCC TTACACTCCA CAGTCCCGGT CCCGCCATGT CCCAGAAACA 120 

91 AGAAGAAGAG AACCCTGCGG AGGAGACCGG CGAGGAGAAG CAGGACACGC AGGAGAAAGA 180 

92 AGGTATTCTG CCTGAGAGAG CTGAAGAGGC AAAGCTAAAG GCCAAATACC CAAGCCTAGG 240 

93 ACAAAAGCCT GGAGGCTCCG ACTTCCTCAT GAAGAGACTC CAGAAAGGGC AAAAGTACTT 300 

94 TGACTCAGGA GACTACAACA TGGCCAAAGC CAAGATGAAG AATAAGCAGC TGCCAAGTGC 360 

95 AGGACCAGAC AAGAACCTGG TGACTGGTGA TCACATCCCC ACCCCACAGG ATCTGCCCCA 420 

96 GAGAAAGTCC TCGCTCGTCA CCAGCAAGCT TGCGGGTGGC CAAGTTGAAT GATGCTGCCC 480 

97 GGGGCTCTGC CAGATCCTGA GACTGCTTTT GCCGCTTCCC CTCCCTGCCC CACCCGGGTC 540 

98 CTGTGCTGGC TCCTGCCCCT TCAGCCAGGG GTCAGGAGGT GGCTCGGGTG TGGGCTGGAG 600 

99 AGGCAGAAGC CCTTTCCTGT TGGTGTCCCA GCACATGGAG CCCCTTGGGC TGAGCACCAA 660 



i 



PAGE: 3 RAW SEQUENCE LISTING DATE: 04/26/2001 

PATENT APPLICATION US/09/824J78 TIME: 22:19:00 

INPUT SET: S36619.raw 

100 GACCTTGAAC CTTTTTTGTT TTACCTTTTT TCCAAATAAC AGTTGGGAGA AATATCAATG 720 

101 AAATTCTGGG GGTGGGGGTG GGGCTGAAAG GGTGGGGTGG GAGATATGGA GGAGTATGAA 7S0 

102 TTAGGGGTTG GAGTTGGTAA AAACATTCCT GACTATCCTT CTTAACCACG TGGCTGATGT 840 

103 GGGGTAGGTA TGAGGGGAAG GAAGTGGAGT AGCCTAATGA AAAGGGGTTC TAGTTGAGCT 900 

104 CTGTAGATAA ATGCCTTGTT TCAGTGTGGT TGGAGACCTG GTGTCAGATA AAAGAAACTC 960 

105 CATCCGCACA GACAGATGCA AACAGCTCCT CTAGTTCTGC AGAGCTAGTT GAGAACTCAA 1020 

106 CATTAATCAT TTTAAAAAGT ACTGTCCTTG AAATAGATTT GCTGTGGGAA GAAGGGCAGT 1080 

107 GAGTGTGGGA GAAAGGAGCC GTGAGCGTGG GGAACCCCAC AGAGCCCAAA GGACTTTTTC 1140 

108 AGTATTCGAA ATAAACAAAA CAAAAACCCA TGAAAAAACC CAAAAAAAAA AAAAAAAAA 1199 
109 

110 

111 (2) INFORNATION FOR SEQ ID NO: 3: 

112 (i) SEQUENCE CHARACTERISTICS: 

113 (A) LENGTH: 117 amino acids 

114 (B) TYPE: amino acid 

115 (C) STRANDEDNESS : single 

116 (D) TOPOLOGY: linear 

117 (ii) MOLECULE TYPE: protein 

118 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
119 

120 Met Ser Gin Lys Gin Glu Glu Glu Asn Pro Ala Glu Glu Thr Gly Glu 

121 1 5 10 15 

122 Glu Lys Gin Asp Thr Gin Glu Lys Glu Gly He Leu Pro Glu Arg Ala 

123 20 25 30 

124 Glu Glu Ala Lys Leu Lys Ala Lys Tyr Pro Ser Leu Gly Gin Lys Pro 

125 35 40 45 

126 Gly Gly Ser Asp Phe Leu Met Lys Arg Leu Gin Lys Gly Gin Lys Tyr 

127 50 55 60 

128 Phe Asp Ser Gly Asp Tyr Asn Met Ala Lys Ala Lys Met Lys Asn Lys 

129 65 70 ■ 75 80 

130 Gin Leu Pro Ser Ala Gly Pro Asp Lys Asn Leu Val Thr Gly Asp His 

131 85 90 95 

132 He Pro Thr Pro Gin Asp Leu Pro Gin Arg Lys Ser Ser Leu Val Thr 

133 100 105 LIO 

114 Ser^ Ly s Lou -At-a-Giy ' . 

135 115 

136 

137 

138 (2) INFORMATION FOR SEQ ID N0:4: 

139 (i) SEQUENCE CHARACTERISTICS: 

140 (A) LENGTH: 121 amino acids 

141 (B) TYPE: amino acid 

142 (C) STRANDEDNESS: single 

143 (D) TOPOLOGY: linear 

144 (ii) MOLECULE TYPE: protein 

145 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
146 

147 Met Ser Gin Lys Gin Glu Glu Glu Asn Pro Ala Glu Glu Thr Gly Glu 

148 1 5 10 15 

149 Glu Lys Gin Asp Thr Gin Glu Lys Glu Gly He Leu Pro Glu Arg Ala 

150 20 25 30 

151 Glu Glu Ala Lys Leu Lys Ala Lys Tyr Pro Ser Leu Gly Gin Lys Pro 

152 35 40 45 



:4 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/824,178 



DATE: 04/26/2001 
TIME: 22:19:01 



INPUT SET: S36619.mw 



153 


Gly Gly Ser Asp Phe 


Leu 


Met Lys Arg 


Leu 


Gin Lys Gly Gin Lys Tyr 


154 


50 , 




55 




60 


155 


Phe Asp Ser Gly Asp 


Tyr 


Asn Met Ala 


Lys 


Ala Lys Met Lys Asn Lys 


■L D C) 


65 


70 






75 80 


157 


Gin Leu Pro Ser Ala 


Gly 


Pro Asp Lys 


Asn 


Leu Val Thr Gly Asp His 


158 


85 






90 


95 


159 


He Pro Thr Pro Gin 


Asp 


Leu Pro Gin 


Arg 


Lys Ser Ser Leu Val Thr 


160 


100 




105 




110 


161 


Ser Lys Leu Ala Gly 


Gly 


Gin Val Glu 






162 


115 




120 . 






163 












164 












165 


(2) INFORMATION FOR 


SEQ 


ID NO: 5: 







166 (i) SEQUENCE CHARACTERISTICS: 

167 (A) LENGTH: 20 base pairs 

168 (B) TYPE: nucleic acid 

169 (C) STRANDEDNESS : single 

170 (D) TOPOLOGY: linear 

171 (ii) MOLECULE TYPE: DNA 

172 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
173 

174 GAAGAGACTC CAGAAAGGGC 

175 

176 

177 (2) INFORMATION FOR SEQ ID N0:6: 

178 (i) SEQUENCE CHARACTERISTICS: 

179 (A) LENGTH: 20 base pairs 

180 (B) TYPE: nucleic acid 

181 (C) STRANDEDNESS: single 

182 (D) TOPOLOGY: linear 

183 (ii) MOLECULE TYPE: DNA 

184 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
185 

186 GAAGCAATAG CAGCCTAAGG 

187 

TBS 

189 (2) INFORMATION FOR SEQ ID NO: 7: 



190 (i) SEQUENCE CHARACTERISTICS: 

191 (A) LENGTH: 19 base pairs 

192 (B) TYPE: nucleic acid 

193 (C) STRANDEDNESS: single 

194 (D) TOPOLOGY: linear 

195 (ii) MOLECULE TYPE: DNA 

196 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 
197 



198 TGTGCTGGGA CACCAACAG 

199 

200 

201 (2) INFORMATION FOR SEQ ID NO: 8: 



202 (i) SEQUENCE CHARACTERISTICS: 

203 (A) LENGTH: 29 base pairs 

204 (B) TYPE: nucleic acid 

205 (C) STRANDEDNESS: single 



f 



PAGE: 5 



206 

207 

208 

209 

210 

211 

212 

213 

214 

215 

216 

217 

218 

219 

220 

221 

222 

223 

224 

225 

226 

227 

228 

229 

230 

231 

232 

233 

234 

235 

236 

237 

238 

239 

240 

241 

242 

243 

244 

245 

246 

247 

248 

249 

250 

251 

252 

253 

254 

255 

256 

257 

258 



RAW SEQUENCE LISTING DATE: 04/26/2001 

PATENT APPLICATION US/0m24,178 TIME: 22:19:01 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



INPUT SET: S36619.raw 



GGAAGGCGCC ATGTCCCAGA AACAAGAAG 2 9 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
■ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CAAAGTCGAC TTACCCCGCA AGCTTGCTG 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCAAGTCGAC TCATTCAACT TGGCCACCC 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear \ 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Met 


Ser 


Ala 


Glu 


Val 


Pro 


Glu Ala Ala 


Ser 


Ala 


Glu 


Glu 


Gin Lys Glu 


1 










5 






10 






15 


•Met 


Glu 


Asp 


Lys 


Val 


Thr 


Pro Glu Lys 


Ala 


Glu 


Glu 


Ala 


Lys Leu Lys 








20 






25 










30 


Ala 


Arg 


Tyr 


Pro 


His 


Leu 


Gly Gin Lys 


Pro 


Gly Gly Ser Asp Phe Leu 






35 








40 








45 




Arg 


Lys 


Arg 


Leu 


Gin 


Lys 


Gly Gin Lys 


Tyr 


Phe 


Asp 


Ser 


Gly Asp Tyr 




50 










55 






60 






Asn 


Met 


Ala 


Lys 


Ala 


Lys 


Met Lys Asn 


Lys 


Gin 


Leu 


Pro 


Thr Ala Thr 


65 










70 






75 






80 


Pro 


Asp 


Lys 


Thr 


Glu 


Val 


Thr Gly Asp His 


He 


Pro 


Thr 


Pro Gin Asp 










85 






90 








95 


Leu 


Pro 


Gin Arg 


Lys 


Pro 


Ser Leu Val 


Arg 


Ser 


Lys 


Leu 


Ala Gly 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/09/824,178 



DATE: 04/26/2001 
TIME: 22:19:01 



Original Text 



INPUT SET: S36619.mw 



SEQUENCE MISSING ITEM REPORT 
PATENT APPLICATION US/09/824,178 



DATE: 04/26/2001 
TIME: 22:19:01 



INPUT SET: S36619.raw 



< < THERE ARE NO ITEMS MISSING > > 



PAGE: 1 SEQUENCE CORRECTION REPORT DATE: 04/26/2001 

PATENT APPLICATION US/09/824,178 TIME: 22:19:01 

INPUT SET: S36619.raw 



Line 
42 



Original Text 

(C) DOCKET NUMBER: 6024.US.Ol 



Corrected Text 

(C) REFERENCE/DOCKET NUMBER: 6024.US.Ol 



